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1.  Introduction. 

Let  X  be  a  p -dimensional  random  vector  variable  with  dispersion 
matrix  I  .  Two  well-known  measures  of  multidimensional  scatter,  obtained 
by  generalizing  the  variance,  the  univariate  measure,  are  E  and  the 
generalized  variance  (GV),  J  E  J  =  det(E),  introduced  by  Wilks  (1932  ,1967).  For 
multivariate  normal  populations,  Likelihood  Ratio  Tests  (LRT's)  for  E’s, 
of  course  of  same  dimensionalities,  and  some  optimum  properties  of  these 
tests  are  known.  But,  when  multidimensional  scatter  of  populations  of 
different  dimensions  need  to  be  compared,  these  tests  cannot  be  defined. 
However,  using  j  E  |  ,  which  we  will  nomenclature  as  Standardized  Gen¬ 

eralized  Variance  (SGV) ,  such  comparisons  become  meaningful.  Since  |E| 
represents  the  volume  in  p-dimensions,  note  that  j  5T  [  ^ ^ becomes  a  mea¬ 
sure  so  scaled  as  to  become  comparable  with  scatter  for  a  scalar  random 
variable.  Apart  from  this  generality,  need  for  tests  of  generalized  vari¬ 
ances  has  been  also  felt,  on  its  own  right.  |E|  ,  being  a  scalar,  is 
more  suitable  and  easier  to  work  with  than  the  matrix  E  .  Hoel  (1937)  was 

1This  research  was  partially  supported  by  National  Science  Foundation 
Grant  SES79-13976  at  the  Institute  for  Mathematical  Studies  in  the 
Social  Sciences,  Stanford  University. 
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probably  first  to  realize  this  need  and  later  Eaton  (1967)  studied 
some  problems  of  statistical  Inference  associated  with  a  single  GV. 

Blckel  (1965)  and  others  have  used  "generalized  variance  efficiency"  for 

comparing  various  test  procedures.  The  GV  has  been  extensively  used  in 
applied  research,  e.g.,  by  Goodman  (1966)  in  Agricultural  Statistics, 
Gnanadesikan  and  Gupta  (1970)  in  Ranking  and  Selection,  Arvanitis  and 
Afonja  (1971)  in  Sample  Survey,  Kiefer  and  Studden  (1976)  in  The  Theory 
of  Optimal  Designs,  etc.  While  the  estimation,  e.g.,  van  der  Vaart  (1965), 
Shorrock  and  Zidek  (1976),  and  the  distribution,  e.g., 

Mathai  (1972)  have  been  studied  In  some  detail,  little  seems  to  be 
known  about  tests  for  GV's.  This  paper  attempts  to  bridge  that  gap. 

Suppose  X^'s  are  independently  distributed  as  Np  (U^*^)  >  ^ 

being  general  dispersion  matrices,  i  =  l,...,k  .  LRT's  are  derived  for 

1/pi  2  1/pi 

Hqi  :  | E±|  -  aQ  (given)  >  0,  for  some  fixed  i;  HQ2  :  [ij 


1/p, 

| Dj i  ,  for  some  fixed  i  and  j  and  finally 


1/p. 

]Ei|  all  equal. 


i  »  l,...,k  against  appropriate  two-sided  alternatives.  The  test  cri¬ 
teria  turn  out  to  be  quite  elegant  multivariate  analogues  to  those  in 
the  univariate  cases. 

The  solutions  to  the  distributional  problems  associated  with  the 
various  test  statistics  considered  above  need  extensive  use  of  Special 
Functions.  The  exact  distributions  for  both  the  null  and  nonnull  cases 
are  presented  for  most  of  the  above  test  criteria.  The  percentage  points 
of  these  distributions  can  be  obtained  from  existing  mathematical  tables 
since  the  distributions  are  represented  in  suitable  computable  forms. 
Examples  of  construction  of  tables  and  the  general  procedure  of  obtain¬ 
ing  them  for  such  computable  forms  of  the  distributions  exist  in  current 
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literature,  e.g.,  Mathai  and  Katiyar  (1979).  Further,  many  of  the  existing  table 
can  also  be  exploited  to  give  the  percentage  points.  Large  sample  approxi¬ 
mations  to  the  above  distributions  are  also  presented. 

Finally,  considering  general  E^s  equal  dimensions,  for  HQ1 
and  Hq2  »  it  is  shown  that  the  "modified"  LRT's  are  unbiased — a  result 
parallel  to  Sugiura  and  Nagao  (1968)  on  tests  of  covariance  matrices. 

2.  Applications. 

In  addition  to  the  mathematically  interesting  nature  of  the  problem 
and  the  applications  cited  above,  there  lies  a  rich,  fertile  area  for 
numerous  applications  of  the  SGV's.  In  fact,  wherever  variance  is 
employed  for  univariate  situations,  SGV's  seem  to  be  applicable  for  the  multi¬ 
variate  situations  of  'overall'  variability.  Some  examples  are  cited  below. 

(a)  Multivariate  Quality  Control.  It  is  well  known  [e.g.,  see 
Steyn  (1978)]  that  testing  H^:  the  population  mean  vector  y^_  of 

X  ,  X^  ~  N^U^.E),  remains  constant  during  the  sampling  process  against 

the  alternative  that  yr  varies  during  the  process,  is  equivalent  to 

testing  H^:  GV  *  |E]  against  GV  *  )E*|  ,  where  E*  =  (I+2D/n)E, 

D  *  £  y  y'  E  ,  n  being  the  sample  size.  One  of  the  many  applications 

r=l  ~r  ~r 

of  this  result  and,  hence,  test  for  SGV  can  be  seen  in  multivariate  quality 
control. 

(b)  Generalized  Canonical  Variable (GCV)  Analysis.  When  the  p-corponcnt 
original  vector  can  be  divided  into  k  >  2  mutually  exclusive  groups, 

Anderson  (1958),  Problem  5,  pp.  305-306,  proposed  GCV's  to  be  obtained  by 
minimizing  their  GV.  Steel  (1951)  and  Kettenring  (1971) (as  in  his  Ph.D.  thesis) 
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have  attempted  to  construct  such  GCV's.  Sen  Gupta  (1980a)  constructed 
GCV's  which  are  equicorrelated  and  formulated  and  studied  some  inference 
problems  associated  with  such  GCV's.  However,  no  results  on  statistical 
inference  associated  with  the  GCV's  obtained  by  Anderson-Kettenring- 
Steel's  met  ho’  are  available.  If  the  original  vector  can  be  meaning¬ 
fully  subgrouped  into  <  k£  <  . . .  <  kg  groups,  then  one  is  faced  with 
the  problem  of  choosing  between  canonical  variables  with  dimensions 


k^,  i  *  l,...,s  .  For  the  k^-th  subgrouping,  the  original  vector  is 

split  into  k.  mutually  exclusive  and  exhaustive  classes,  the  j-th 
1  kt 

class  consisting  of  p.  elements,  £  p.  =  p,  i  =  l,...,s  .  Obviously, 

J  j=l 

since  the  criterion  of  minimum  GV  is  being  used  to  represent  the  original 


vector  by  a  smaller  dimensional  one,  a  GCV  with  the  smallest  dimension 
k^  ,  will  be  preferred  provided  they  all  have  the  same  comparable  GV's, 
i.e.,  the  same  SGV's.  Also,  it  may  be  meaningful  to  represent  the  ori¬ 
ginal  vector  by  different  GCV's  of  the  same  dimensions,  k  ,  but  whose 


components  are  derived  from  different  subgroupings  of  the  original  vector. 


Gnanadesikan  (1977),  pp.  74-77,  drawing  from  a  well-known  example  in 


psychometry,  considers  three  (=k)  sets  of  scores  by  several  people  on 


three  batteries  of  three  tests  each,  i.  e. ,  p^  =  =  p^  =  3  .  Ttie  three  tests 

in  each  battery  were  intended  to  measure,  respectively,  the  verbal,  the 


numerical,  and  the  spatial  abilities  of  the  persons  tested.  He  commented, 
"An  interesting  alternative  analysis  in  this  example  (...)  would  be  to 
regroup  the  nine  variables  into  three  sets  corresponding  to  the  three 
abilities  measured  rather  than  the  three  batteries  of  tests."  If  the  prob¬ 
lem  is  considered  in  the  more  general  set-up  that  the  observed  values 
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merely  constitute  a  sample  from  the  underlying  population,  then  how 
profitable  the  suggested  alternative  analysis  will  be  must  be  judged 
through  related  statistical  tests  of  significance.  Furthermore,  similar 
problems  for  a  combination  of  the  cases  considered  above,  i.e.,  where 
one  is  faced  with  the  problem  of  comparing  several  GCV's,  some  of  which 
may  possibly  differ  in  dimensions,  are  quite  important  from  practical 
considerations.  Such  statistical  problems  as  comparing  GCV's  of  possibly 
different  dimensions  obtained  by  Anderson-Kettenring-Steel ' s  method  can 
thus  be  formulated  as  tests  for  SGV's. 

(c)  Generalized  Homogeneity  of  Multidimensional  Scatter.  Dyer  and 

Keating  (1980)  were  interested  in  the  homogeneity  of  variances  of  sealed 
bids  of  five  Texas  offshore  oil  and  gas  leases.  The  variances  were  com¬ 
puted  from  the  biddings  of  the  oil  companies,  whose  numbers  and  identities 
varied  from  lease  to  lease.  In  similar  situations,  if  repeated  observa¬ 
tions  are  available  for  the  leases  or  if  the  variabilities  (for  the 
bidding  patterns)  for  certain  companies  for  repeated  bids  over  possibly 
different  numbers  of  leases  need  to  be  compared,  then  one  is  faced  with 
the  problem  of  testing  homogeneity  of  "variances,"  which  we  will  term 
"Generalized  Homogeneity"  for  vector  variables  of  possibly  different 
dimensions.  This  will  be  equivalent  to  testing  homogeneity  of  SGV's, 
which  is  precisely  defined  above. 

(d)  Missing  Observations  on  a  Random  Vector.  In  cases  of  missing 
observations  on  part  of  a  random  vector  [Dahiya  and  Korwar  (1980),  Goodman 
(1968)],  generalized  homogeneity  as  considered  above  can  be  quite  useful 
for  comparing  multidimensional  scatter.  Suppose  for  an  agricultural  product 
P,  there  are  q  important  characteristics  with  regard  to  variability. 


5 


Let  the  complete  observation  vector  be  X  =  (X^,...,Xr),  X^:  *  1, 

r 

i  ■  1 . r,  £  q  ■  q.  Let  observations  be  available  from  population 

1  1 

j  on  only  tj  £  r  of  the  X^'s,  say,  with  no  loss  of  generality,  on 
X  *  (X1,...,X  ),  j  *  l,...,k.  Since  usually  for  an  agricultural 

j  ti 

product  P,  the  overall  variability  of  the  product  can  be  represented 
in  terms  of  the  volume,  one  can  use  GV  as  a  measure  of  this  overall 
variability.  In  dealing  with  populations  of  maize  and  cotton,  this 
notion  was  reflected  by  Goodman  in  his  concluding  remark:  "The  six 
cotton  populations  indicate  even  more  clearly  that  the  generalized 
variance  is  a  useful  measure  of  overall  variability  that  merits  futher 
investigation."  Let  it  be  desired  to  compare  the  multidimensional 
scatter  of  P  over  the  k  populations .  Due  to  missing  observations 


one  possibility  is  to  consider  the  random  vector,  say  X  ,  u  -  Min  t. 

j  J 

for  which  observations  are  available  from  all  the  populations  and 
compare  the  GV's  of  X^.  This  was  the  approach  of  Goodman's  analysis 
where  "Only  plants  for  which  complete  data  were  available  were  used  for 
these  analyses."  This  approach  however  results  in  loss  of  data.  The 


other  and  better  alternative,  in  case  the  missing  observations  are  in 

the  format  described  above  would  be  to  compare  the  SGVs  of  the  X 

r  J 

[If  measurements  on  the  components  are  in  different  units,  we  can  consider 

the  standardized  scatter  coefficient,  where  scatter  coefficient,  defined 

by  Frisch  (1929),  is  the  determinant  of  the  correlation  matrix].  This 

seems  reasonable  since  more  characteristics  related  to  the  variability 

* 

of  P  is  considered  through  X  's. 

3 

(e)  Ranking  and  Selection.  The  method  of  ranking  and  selection 
based  on  GV s  as  proposed  by  Gnanadesikan  and  Gupta  (1970)  for 
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equi-dimensional  vector  variables  has  a  natural  extension  to  an  initial 
test  for  SGVs  for  vector  variables  differing  in  dimension  followed  by 
ranking  and  selection  procedures  based  on  the  SGVs. 

(f)  Cluster  Analysis.  Often  in  cluster  analysis,  after  the  clusters 
are  identified  the  question  of  homogeneity  of  clusters  is  of  vital  impor¬ 
tance,  as  in  Mezzich  and  Solomon  (1980).  Tests  for  SGVs  can  be  exploited 
advantageously  for  such  circumstances. 

3.  Likelihood  Ratio  Tests  for  SGV's. 

Let  X  'V  Np(y,E).  Throughout  our  discussion,  unless  otherwise  stated 
we  will  assume  £  to  be  non-singular.  Denote  the  population  SGV  of 
X,  |l|^p  by  A2  and  that  of  the  sample,  js/N|^p  by  d2  where  S 
is  the  sample  sums  of  products  matrix  based  on  a  sample  of  size  N.  Also 
denote  |s|^p  by  s2.  [Note  that,  Anderson  (1958)  defines  GV  with  the 
divisor  N-l  instead  of  N] . 


3.1.  Test  for  a  Specified  Value  of  SGV.  Let  x ^ , . . . ,xN  be  a  random 

2  2 

sample  from  N^(y,I)  and  suppose  we  want  to  test  H^:  A  *  (specified) 
2  2 

against  H^:  A  5*  °q  •  (Note  that  Hq  is  equivalent  to  the  hypothesis 
that  the  GV,  |s|  ,  has  the  specified  value  a2p) .  Since  the  Hq  does  not 
strain  y,  we  have  y  *  x.  To  find  the  MLE  of  2  under  HQ,  consider 


*  -  in  C  +  [f  In  0t  -  +|  Jin  s2p  -  Jin  8±  -  Jin  p2p  , 

where  C  ■  (211)  Np/,2|s|  N^2,  X  Is  the  Lagrange  undetermined  multiplier, 
9^*  i  ■  1,  ...  ,  p  are  the  characteristic  roots  of 

and  we  have  used  the  fact  that  |z|^p  ■  p2  is  equivalent  to 

8  P  “  \  "  *>n  ^qP*  s2p  ■  | S |  .  Differentiating  4>  w.r.t.  0^ 

and  equating  to  zero,  we  have  N  -  X  -  0  ,  i  -  l,...,p.  So, 


con- 
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(N-X)p  -  s2p/cr2p  ♦91  -  s2/a2  ,  i*l,...,p  . 

Hence, 

L/L  »  (LaN^2  exp(-p/2  a*^P)  *  f(a),  say,  where  =  (e/N)np^2  and  a»s2p/a2p  . 
co  Q 

However , 

f(a)  t  a  <  Np  and  4-  a  >  Np  . 

So,  we  get, 

2  2  2  2 

Result  1.  The  LRT  for  HQs  A  -  0Q  against  H^  A  J  aQ  can  be 
equivalently  given  by. 

Reject  HQ  iff  d2p/a2p>aQ  or  <  a1  , 

where  a^  and  a^  are  constants  to  be  determined  from  the  specified  level  of 
the  test. 


3.2.  Test  for  the  Equality  of  the  SGVs  of  Two  Independent  Multi¬ 
variate  Normal  Populations.  Let  xy,  u  -  1,...,N^  and  yy,  v  =  1,...,N2 

denote  two  Independent  random  samples  from  Np  (y^.Zj),  i  ■  1,2  respec- 

2^2  22 
tively.  We  are  interested  in  testing  Hq:  A^  “  A^  against  H^:  A^  1*  A2» 

2 

where  A^,  i  ■  1,2  are  the  population  SGVs  of  X  and  Y  respectively. 

As  in  Section  3.1,  y^  -  x,  y^  ■  y.  Let  0^,  i  ■  l,...,p^  and  ri  , 
j  -  l,...,p2  be  the  characteristic  roots  of  X^S^  and  re8Pec~ 

tively  where  S^,  i  ■  1,2  are  the  sample  sums  of  products  matrices  for 
X  and  Y  respectively.  For  finding  the  MLEs  of  E^s  under  H^,  consider 
as  before. 
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where  C  is  a  constant,  A  is  the  Lagrange  undetermined  multiplier, 

2p  ...  1/Pl  .  ,1/P2 

*  =  |s  |,  i  *  1,2  and  we  have  used  the  fact  that  |EjJ  =  [ E2 I 
is  equivalent  to  the  quantity  within  the  third  bracket  on  the  right-hand 
side  of  (3.1)  being  zero. 

Differentiating  w.r.t.  0^  and  n^s  and  equating  these  to  zeros, 
we  have 


'  2Pl  2Pl]1/pl 

**  A  *  p^0^  $  1*  1)  •  •  • ) p^  * ^  ”*  ^  S  /a0 


P2^2  ^  *  3  *  •  >p2  P2^2  ^  ”*  ^2  s2  ^0 


2p,  2p2|1/f2 


These  give  0^  -  (pjS*  +  P2S2^^P1N1  +  P2N2^  *  ^plNldl  +  p2N2d2  ^P1N1  +  P2N2 
2 

where  d^  ,  i  =  1,2  are  the  sample  generalized  variances  for 
X  and  Y  respectively  and  -  A 2  ~  a2,  the  common  unknown  value.  Note 
that  Og,  the  MLE  of  0*  agrees  with  that  in  the  case  of  Px  “  P2  “  1* 

Also  0QPl  and  a*2  give  the  MLEs  of  |Z1|  and  j^j,  the  generalized 
variances  of  X  and  Y  respectively.  Using  these  estimates,  the  Likelihood 
Ratio  criterion  is  given  by. 


(3.2.1) 


l/a  ‘  H 


2)Vl/2(s2)N2P2/2l  j(a^)  (N1P1+N2P2)/2 


C.  a 


N2P2/2 


(P1a+P2) 


(NlPl+N2p2)/2 


=  f(a),  say,  where  a 


2  2 

s^/s2  and  C  is  a  constant. 


However,  f(a)  +  a  >  N2p2  and  +  0  <  a  <  0  N2p2/Nipi  •  Tllus 

Result  2.  The  LRT  for  H^:  A2  =  A2  against  H^:  A2  4  A2  can  be 
equivalently  given  by. 


2  /j2 


Reject  Hq  iff  R  *  d1^d2  <  or  >  r 


where  r^  and  r2  are  constants  to  be  determined  from  the  specified 
level  of  the  test. 


3.3.  Test  for  the  Equality  of  SGVs  of  k(>  2)  Independent  Multi¬ 
variate  Normal  Populations.  Let  ^  *  1,...,N  ,  i  =  l,...,k  denote 

k  random  samples  from  k  independent  populations  ,  i  »  1,..., 

respectively.  We  are  interested  in  testing  Hq:  A^  ,  i  »  l,...,k 
all  equal,  against  the  alternative  H^,  that  at  least  one  of  them  differ. 


k 


As  before,  under  both  Hq  and  ,  jj  ■  ,  i  “  l,...,k  .  Let 

,  i  •  1 . k,  j  ■  l,...,p^  be  the  characteristic  roots  of 

respectively  where  S^,  i  ■  l,...,k  are  the  sample  sums  of  products 
matrices  for  ,  i  »  l,...,k  respectively.  For  finding  the  MLEs  of 

,  i  •  l,...,k  ,  under  Hq  ,  consider  as  before, 
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(3.3.1) 


k  Pi 


*•0+1  l 

1-1  j-1  1 


1  1 
-4  in  e . .  -  4  e . 


ij  2  Ijj 


1=1 


lx  , 

' 

"  1  2 
Z  —  Jin  8  -  Jin  a  . 

2  ii+1 

b 

j  Pt  ij  i 

Z  — —  2-n  9  .  -Jin 

[j  *1+1  1+lj  1+1 


where  C  Is  a  constant  and  X  are  undetermined  Lagrange  multipliers 

with  k+1  being  replaced  by  1  in  the  suffixes.  Differentiating  <J> 
with  respect  to  an<*  equating  to  zeros  we  have. 


piKi+<xn+r\-ii)  ■  »ieij  ;  1*1 . k-  J"1 . pi  •  xoi  ■  V 


*  v  ■*  ey  -  sl/5o  •  1  ■  1 . k  ■ 


*2  2  2 
where  is  the  MLE  of  (TQ  ,  the  common  unknown  value  of  A^  ,  i=l,...,k. 

So 


50  E  piNi  +  S0  IJ1<Xii+l*Xi-li)1  “EpiSi 


S0  "  E  Pisi/E  piNi  ‘ 

.i.  i 


Note  again  this  agrees  with  the  MLE  for  0^  of  the  univariate  case. 
Hence,  we  get. 


Result  3.  The  LRT  for  H^:  A^  ,  all  equal,  against  H^:  at  least 
2 


one  of  the  A^  ,  i  -  l,...,k  ,  differ  is  given  by 

k 

n 

i=l 


k  N  p  /2 

Reject  Hq  if  and  only  if  ,  n  -  II  (d^/a^)  11  < 


11 


r 


where  Dq  is  a  constant  to  be  determined  from  the  specified  level  of 
the  test. 


4.  Exact  Null  and  Non-Null  Distributions  of  the  Test  Criteria. 


4.1.1.  Definitions  and  Decisions.  In  order  to  obtain  the  exact 
distributions  under  null  and  alternative  hypotheses,  for  the  test 
statistics  considered  in  Section  3,  we  recall  the  following  definitions 
and  preliminary  discussions  from  Mathai  [(1970),  (1972),  (1973)]. 

Meijer's  G-Function.  The  G-function  is  defined  as. 


(4.1.1) 


Gm’n  \z  ai”“’ap  =  (2  tt  i) —1  h(s)  z"S  ds  , 

p,q  t.  b1,...,b]  Jl 


where  i  -  (-l)  ,  z  is  not  equal  to  zero  and 


2 

z  =  exp{s  (£n|z)  +  i  arg  z)} 


in  which  £n  |zj  denotes  the  natural  logarithm  of  jz|  and  arg  z  is 
not  necessarily  the  principal  value,  and 


n  ra  q  p 

h(s)  -  n  r (i  —  a  —  s)  n  r(b  +s)/{  n  r(i-b  -s)  ir  r(a,  +  s)} 

j»l  j=l  ^  j=m+l  J  j-n+1  ^ 


where  m,  n,  p,  q  are  integers  such  that,  0  S  n  S  P  ,  l^.m<_q 


a^(j  ■  l,...,p)  ,  b^  ( j  *  1»  •  •  •  » 9) 


are  complex  numbers  such  that 


(bh  +  v)  ^  (a  -  1  -  r)  ,  for  v,  r  =  0,1,..., 


L  is  a  contour  in  the  complex  s-plane  such  that  the  points, 


(4. 1.2) 


-s  =  (bj+v)  ,  j  -  1, . . .  ,m  ;  v  =  0,1,... 


and 


-s  “  j  =  1 . n  5  V  “  °*1— * 

are  separated  and  the  points  in  (4.1.2)  are  enclosed  by  L  .  An  empty 
product  is  interpreted  as  unity.  The  function  (4.1.1)  makes  sense  in 
the  following  cases 

For  every  z  1*  0  if  q  -  p  is  positive  and  for  0  <  |z|  <1 
if  q  -  p  =  0  .  The  existence  of  different  contours  L  is  discussed 
in  Erdelyi  (1953,  p.  207). 

The  Gauss-Legendre  Multiplication  Formula. 

(4.1.3)  T(mz)  •=  (27T)(1-m)/2  mm2-1/2  II  T(z  +  j/m)  ,  m  -  1,2,...  . 

j-0 

This  formula  (4.1.3)  enables  one  to  write  a  Ganma  of  an  integral 
multiple  of  z  in  terms  of  Gamma  of  z. 

The  technique  of  inverse  Mellin  transform  can  be  advantageously 
employed  when  a  moment  sequence  uniquely  determines  a  distribution. 
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Rao  (1973,  p. 106)  has  given  a  number  of  sufficient  conditions  for  the 


unique  existence  of  the  density  function.  If  y  s  -  0,1,... 

is  the  (s-l)th  moment  about  the  origin  of  T,  uniquely  determining 
its  density,  then  the  density  g(t)  of  T  is  given  by  the  inverse 
Mellin  transform 


(4.1.4) 


g(t)  =  (2  tt  i) 


C+ioo 

C+ioo 


U 


s-1 


-s 

t 


ds 


where  i  =  /^T  and  C  is  a  suitably  chosen  real  number.  For  multi¬ 
variate  normal  populations  it  is  seen  that  at  least  one  of  the  condi¬ 
tions  cited  in  Rao  is  satisfied  by  the  moment  sequence. 

A  computable  representation  of  the  G-function  of  (4.1.1)  using  the 
Calculus  of  Residues  is  given  by. 


(4.1.5) 


sm,n 

p>q 


ai . aP 

i  h^, . . . ,b^ 


n 


.  n  m.  -1 

t  r  r  jrr-v 


=  l  l  A  +  I  I  l  l  R 

k=l  j=l  J  r*=l  j»l  v—  1  i»0 


where  A^  is  given  in  (5.21)  and  Rvi  is  given  in  (5.27)  of  Mathai 
(1970,  p.  141). 

Plncherle's  H-Function.  The  G-function  is  a  special  case  of  the 
most  general  Special  Function,  namely  the  H-function.  Braaksma  (1964) 
has  discussed  the  H-function  in  detail.  Following  Mathai,  slight  modi¬ 
fications  of  the  original  definition  are  made  here  in  order  to  represent 
it  as  an  inverse  Mellin  transform.  We  define  the  H-function  as  follows. 
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(4.1.6) 


_  n  "  | (a  ,a) 
H(z)  -  H®>n  z  1  1  P  P 

p,q  L  '(bl,B1),...,(bq,3q)J 


m 


{. 


n  ra>, +  fls)  n  r(i-a.-aAs) 


n»l 


J  "j 


J=L 


j  “j 


2tt1  j  q  p 

n  r(i-b  -m)  n  r(a  +a,s) 
j-m+1  3  3  j*=n+l  3  3 


“8  j 

z  ds 


where  1  *  /~1  ,  p,  q,  m,  n  are  integers  such  that  1  £  m  ^  q,  0£n_<p 
(j  *  l,...,p)  ,  g  (j  ■  l,...,q)  are  positive  numbers  and 
a^  (j  *  l,...,p)  ,  b  (j  ■  l,...,q)  are  complex  numbers  such  that. 


Oj(bn+v)  ^  0h(aj-l-X)  ,  for  V,  X»  0,1 . .  h  «  l,...,m  ;  j  »  1 . n 


L  is  a  contour  separating  the  points. 


(4.1.7)  -s*>  (b^+v)/^  ,  j  -  l,...,m;  v  -  0,1,, 


and 


(4.1.8)  -s  -  (aj-l-v)/aj  ,  j  -  l,...,n  ;  v  -  0,1,...  . 


One  condition  of  existence  of  the  H-f unction  is  that  it  exists,  for 
every  z  ^  0  if  u  >  0,  where 


V  "  l  “  l  “j 

1  3  i  J 


and  for  0  <  |z|  <  g  ^  ,  if  y-0  where 
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P  a,  q  -0, 

g  =  rr  a  J  n  g  J  . 

i  J  i  j 

The  result  due  to  Braaksma  [p.  278,  (6.1)]  which  effectively 

says  that  H(z)  is  available  as  the  sum  of  the  residues  of  h(s) 

_s 

z  in  the  points  (4.1.7)  is  not  affected  by  the  modification  mentioned 
earlier  of  the  original  definition  of  the  H-function.  A  computable 
representation  of  the  H-function,  with  the  detailed  method  of  identify¬ 
ing  the  poles  is  given  in  Mathai  (1973)  as  follows. 


m 

(4.1.9)  H(z)  -11*. 

j-1  J 


where  the  second  sum  is  over  .  defined  in  (4.10)  and 

J 1  Jm 

defined  in  (4.22)  of  Mathai  (1973). 


R. 

3 


is 


4.2.  Exact  Distribution  of  d^/0QP  .  Since  the  sample  generalized 
variance  d^p,  arise  frequently  in  many  multivariate  tests,  various 
authors,  e.g.  Mathai  (1970),  Consul  (1964),  have  worked 
on  its  distribution.  Mathai's  work  seems  to  be  quite  suitable  in  view 
of  the  distribution  that  will  be  needed  for  Section  4.3.  Letting 

n  r\ 

v*  d  P/0qP  and  using  (4.1.4),  the  density  of  v,  g(v),  under 
Hq  ,  can  be  written  as 


(4.2.1) 

where 


g(v)  -  C(2wi)' 


n  r[(N-j)/2+h]v  ds 
j-1 


C  GP,”(v|(N-j)/2  ,  j  =  1,2,. ..,p)  ,  0  <  v  < 


c"1  -  5j_  r[(N-j)/2] 
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A  computable  representation  of  g^(v)  is  then  obtained  by  substituting 
m  *  p  ,  n  ■  0  ,  p  =  0  ,  q  «  p  in  (4.1.5) 

Explicit  evaluations  for  the  residues  needed  in  (4.1.5)  are  given 
in  Mathai  (1972).  For  this,  Mathai  considered  a  random  variable  with  its 
h-th  moment  given  by, 

(4.2.2)  E(xh)  =  (-)  n  IL(srAL/,2±hJ.  n  riln.iA)/2-hJ 

a  i-1  r[(n-i)/2]  i-1  r[(m-i)/2] 


Assume  the  moment  sequence  in  (4.2.2)  uniquely  determines  the  density  of 
X.  Then,  through  the  use  of  Calculus  of  Residues  he  obtained  the 
following 

Theorem  Ml.  The  density  function  f (x)  corresponding  to  the  moment 
sequence  in  (4.2.2)  is  as  follows. 


f (x)  =  C(ax/8)m/2  (p+1)/2x-1  l  (R  +R')  ,  x  >  0  . 

j=l  3  3 

a.-l  . 

1  ,  j  a  -1  a 

R.  *  [a  -l)!]"i(ax/8  )3  l  (  3  ) (-log  ax/3)  J 

J  J  r»'o 


a . -1-r 


r-1 


-  *  _  ,  (r-l-r.)  1  1  r-1  (r  -l-r2) 

x  (  l  (  r-3)  X  A  1  l  (  r  )A  1  }B  » 

r^O  rl  r2-0  r2 

Rj  is  Rj  with  (ax/8)^  ,  a^  ,  A^  ,  B  are  replaced  by  (otx/fj)^  *^2, 
b^  ,  A'(t)  and  B'  respectively.  The  various  quantities  for  the  two 
cases  p-even  and  p-odd  are  given  in  pp.  166-167  of  Mathai  (1972). 

For  v  ,  the  moment  sequence  given  in  (4.2.2)  determines  the 

-  0  , 


density  uniquely  according  to  Rao  (1973,  p.  106).  Letting  p' 


a  ■  1  ,  3  =  2P ] E ] /°qP  and  m  =  n  ,  the  density  of  v  is  deduced  in 
a  form  suitable  for  computation.  Using  the  specified  values  for  |z| 
under  the  null  (3  =  2P)  and  alternative  hypotheses,  the  corresponding 
distributions  of  v  are  obtained. 

4.3.  Exact  Distribution  of  R.  Because  of  its  frequent  applications 
in  various  multivariate  test  procedures,  distribution  of  the  ratio  of 
two  independent  sample  GVs  have  received  much  attention,  e.g.  Nandi 
(1977),  Tretter  and  Walster  (1975),  Mathai  (1972)  etc. 

Case  1.  =  p^.  The  test  statistic  R  will  be  equivalent  to  the 

test  statistic  based  on  the  ratio  of  two  independent  GVs,  say,  w,  only 

when  p^  =  p2  *  p,  say.  In  that  case,  the  density  of  w  =  RP  is 

2  2 

obtained  from  Theorem  Ml  above  by  letting  a  =  1/A^  and  3  *  l/Ai  . 

7  2  2 

Under  Hq  a/3  =  1  .  Under  any  alternative  H:  A^/ 05  6  ,  the 

2  2  2 

distribution  of  R  is  obtained  by  substituting  a/3  =  A2/A^  “  1/6 
in  Theorem  Ml. 

Case  2.  p^  1*  p^.  In  the  case  of  unequal  dimensions,  the  distribution 
of  R  is  not  available.  We  obtain  the  distribution  in  terms  of  the 
H-function  and  present  it  in  a  computable  form  through  the  use  of  Calculus 
of  Residues.  Now, 

h  P1  p2 

(4.3.1)  E(Rh)  *  c  (a?/a?)  n  r{|(N. i)}  n  r4(N.-— -i)} 

•  12  1  2  1  1  22  P2 

where 

k  p2 

c  -  [ji  r((N  -i)/2}  n  r((N.-i)/2}]  . 

1  1 
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Hence  from  (4.1.4),  on  using  (4.1.6),  the  density  of  R,  g2(r),  under 


Hq  ,  can  be  written  as 


(4.3.2)  g2(r)  =  (2iri) 


U 


E(Rh)  r“h  dh 


-  C,  •  H 


|  »Pl)  »  *  *  »  »  ) 

r  1  1  p2’pl 


P1’P2 

P2 ,P1  L  l(b1,p2),...,(b 


P1,P2 


,  0  <  r  < 


Since  here  the  a^s  and  the  ft^s  in  (4.1.6)  corresponding  to  (4.3.1) 
are  all  rational  numbers,  we  can  use  (4.1.3)  to  express  g2(r)  in  terms 
of  G-function  also.  However,  use  of  H-function  here  is  a  more  direct 
and  convenient  approach. 

In  order  to  present  (4.3.2)  in  a  computable  form  we  introduce  the 
following  notation.  Consider  (4.1.6).  The  poles  of  r(bj+£jS)  are 
given  by  the  equation 


(4.3.3) 


-s  =>  (bj+v)/B  ,  v  =  0,1,...  . 


If  the  point  -s  =*  (b^+v^)Bj  for  some  V  =  e  {0,1,...}  coincides 
with  the  poles  coming  from  y-1  other  Gammas  of  the  set  r(b^+BjS), 
j  *  l,...,m  then  the  point  gives  a  pole  of  order  y  .  In  order  to 
distinguish  poles  of  all  orders,  for  a  fixed  j  ,  consider  the  equations 


(4.3.4) 


,w<i»  b+v(J2> 

1  Jl-J,  2  Jl-J. 


b,+v 


b  +V 


(jm) 


m  J  i  •  • *J. 


m 
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We  interpret  (4.3.4)  as  follows.  For  a  fixed  j  ,  j  =0  or  1  for 

r 

r  -  1 . m  .  If  j  =  0  ,  then  (b^+vpr^  .  )/ST.  is  to  be  excluded 


,(jj) 


r  J  1  *  *  ’3  r 

l  m 


from  (4.3.4)  v'JJ/  .  is  a  value  of  v  in  (4.3.3).  For  every  fixed 
3l-"3m 

j  ,  +•••+  j  e  {l,...,m}  denotes  the  order  of  the  pole  at 

-s  ■  (b.+vf33^  )/(?..  To  avoid  duplication  we  will  use  the  convention, 

3  3 1*  *  *3m  3 

while  considering  the  points  corresponding  to  (4.3.3),  that 


31  =  32  =  =  3j-l  =  °* 

Denote  by  S^33^  .  the  set  of  all  values  v.^33^.  takes  for  given 

3 1 ‘ ‘ * Jm  31‘ ’ ‘3m 


(jj)  (jj) 


31,32** ‘ ’ ,3m’  i>e-  Sj, 


{v.  .  }  .  Then 

3  1  *  ‘  '  3m 

i  m 


m 


(4.3.5) 


H(z)  =  l  l 


j=i  s:j3)  .  3 

j . . . .  j 

l  m 


— «s 

where  R  is  the  residue  of  h(s)z  at  the  pole  -  s»(b  +  v.  .  )/$. 
J 

^  i  1  ^  1  m 

, ,  ,  v  denotes  the  summation  over  all  sets  S.w~  .  "'''.notes 

s(jj)  _  3 1 *  *  * 3 ^ 

31.  .  .3n 


(jj) 


the  summation  over  all  sets  S 


(jj) 

j  -  . . .  j 
J1  Jm 


.  More  detailed  discussions  can  be 


found  in  Mathai  (1973). 

We  now  determine  the  poles  and  their  corresponding  orders  needed 
in  (4.3.5),  i.e.  only  the  first  product  of  gammas  in  (4.3.1).  It  will 
thus  be  convenient,  for  computational  purposes,  to  choose  p^  <  p£. 

N.-i  . 

For  a  fixed  i  ,  the  poles  of  T[ — r—  +  — ]  »  T  are  given  by  the 

z  i 

equat ion 


-s  -  p1((N1-i)/2  + v)  ,  v  -  0,1,2,... 
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Note  that  poles  of  and  coincide  only  when  i  and  j  are  both 

even  or  both  odd. 

Lemma  4,3.1.  The  poles,  with  their  corresponding  orders,  are  given  by 
Case  A.  odd. 

N  -1 

(4.3.6)  {v^oiq. . . ioi^  *  f0,l,2,...}  »  (v)  .  Poles  are  p^-j; — +  v)  » 

pl+! 

repeated  — ^ —  times  . 


{v0CK ^ 0101  •  •  101}  =  {0}  *  Pole  is  ’ 

j-1 

Pl+^-  A  _1 

repeated  — - - times,  j  *  3,5, ...,p 


1-1 


1  " 


N,  -2 


^0101*  • -010*  =  {O*1*2****}  =  {^*  Poles  are  Px (~ — +v), 

Pl-1 

repeated  — —  times  . 


Ni-J 


{v0j.^010..010}  “  {0}  '  Pole  is  pl(~2  }  » 

j-1 

Pl”^  1-2 

repeated  — ^ - — 2~  times  ,  j  *  4,6,...,p^-l 


Case  B.  p^  even. 


N,-l 


(4.3.7)  {viJJo---10}  “  (0,1,2,...)  -  (v).  Poles  are  Pl(-~+v)  , 


repeated  —  times  . 
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{vo^oio..io} " {vl*  Pole  is  PiHr0  * 

pi  j  _i 

repeated  —  -  times,  j  *  3,5, . . . ,p^-l  . 

(22)  V2 

^V0101- • *01^  *  {0,1,2,...}  *  {v}  .  Poles  are  p1(— ~ — +v)  , 


repeated  ■ 1 

2 


times 


{voo».i..oiI  '  {oK  Pole  ls  • 

P1  i-2 

repeated  —  -  times,  j  *  4,6,, 


■  • 


For  i  j  j  ,  is  vacuous  unless  i  and  j  are  both  odd  or 

both  even.  (We  omit  the  subscripts  of  since  it  is  clear  what  they 

are.) 


ft  1 

(4.3.8)  (vv  s  =  {v  +  ~2~)  «  v  ■  0,1,2, ...;£  >  1  is  odd  .  Poles  are 
identified  with  those  of  . 

{v<2£)>  *  (v  +  ^y^}  ,  v  *  0,1,2,...;®.  >  2  is  even.  Poles  are 

(22) 

identified  with  those  of  v  '  . 

{v<£  “  — 2 ~  ,  i  >  t'  >  2  ;  t't  both  odd  or  both  even. 

Poles  are  identified  with  those  of  . 

Proof.  The  above  results  follow  from  the  following  observations. 
Consider  Case  A.  Let  i  »  1  and  j  £  be  any  odd  number.  Then, 
poles  of  rL  and  r  coincide,  as 

Nr1  Ni"J  i+i 

Pl(-y-  +  V)  -  Pl(-±—  +  »■+  (v, A)  -  {(0,  ....  } 
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But  this  set  excludes  the  poles  coming  from 
Consider  j , j '  both  odd,  3  £  j  <  j ’  £  . 


N,-j  N  -j  • 

(4.3.9)  Pl(-±—  +  A)  =  p^-^y-  +  A ’)  .  For 


A  e  (0,1,...,  -  l) 

Then 


=  E. 
J 


Thus  considering  the  'excluded  sets'  E^s  we  note  that  the  smallest 
element,  i.e.  0  ,  is  repeated  in  all  succeeding  ,  through  the 
relation  (4.3.9).  This  establishes  (4.3.6).  A  similar  argument  holds 
for  (4.3.7).  (4.3.8)  follows  from  the  definitions  of  the  corresponding 

sets. 


Theorem  4.3.1.  The  probability  density  function  of  R  is  given  by 


for  p^  odd, 


I  00 

,  (r)  =  C  £  (r/62) 

•[v=0 


V1  -1 

P1(~-  +  v)  (pyD/2  1 


l  f  (r/6  ;  u.s^Aq.Bq) 

u=0 


Pi+1 

.  2  P1<N1-j)/2  J— V-l 

+  l  (r/62)  r  "  ,r2 


j 


l  f (r/6  ;  u,a  ,A0,B0) 

u=0  J 


Nr2  -i 

»  Px( — 2~+ v)  (prl)/2 

+  l  (r/62)  l  f  (r/62  ;  u,b  ,A  ,B  ) 

V-0  u=0 


+  W62) 

j 


Pl(Nrj)/2  1^.-1 


l  f(r/6  ;  u,b  ,A0,Bq) 

u**0  J 
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and  for  p^  even 


g2(r) 


Nr1  PX 

Pl(  2~+v)  ~1"1 

C  l  (r/62)  l  f  (r/62  ;  u.a^.A  ,B  ) 

v=0  u=0 


+  l*  (r/62) 


pi(Nrj)/2 


l  f (r/6  ;  u,a’  A  ,Bq) 

u=0  J 


N  -2  p 

Pl(-|-  +  v)  -i-1 

+  l  (r/62)  l  f (r/62  ;  u,b, ,An>Bn) 

V=0  u=0 


p  (N  -j)/2 

fj  ^  ^  O 

+  l  (r/6  )  £  f (r/6  ;  u,b  ,A0>BQ) 

j  u=0  J 


where 


V1' 


l  0  !  un_1_u  f  U~1  u-1 

f(rju,d,A  ,B  )  -  lu  J  (-log  r>  I  ( 

V  0  u  Iyj^-0  1 


Y.-l  ^l-1  (Y,-l-Y?) 

x  xr  A  A 

I  t2  ao 


u„  P1 


B  *  (s-d)  ®  H  r{-|-(N..  +  — -  i)}  at  s  *  d 
U  ,  ^  i  p. 


where  d  is  a  pole  of  order  u^,  (the  upper 
limit  +1,  for  u  in  the  summation  in  the 
theorem)  of  the  product  of  the  gamma  functions 
defined  by  Bq  . 


“SI  108  ®0  ’  "  -  ° 


2  2  2  a  aa 

C  is  the  constant  defined  in  (4.3.1)  and  6  =  ^^2  •  Ej  and 

jlf  ^ 

denote  the  summations  over  all  j  e  {3,5, ...,p^}  and  j  z  {4,6,...,p^  ) 
*  * 

respectively,  p^  =  p^  if  p^  is  odd  and  p^  =  p^-1  if  p^  is  even; 
** 

p^  =  if  p^  is  even  and  p^-1  if  P-^  is  odd. 


Proof.  The  proof  follows  by  noting  (4.3.5)  and  combining  Lemma  4.3.1 
above  with  Lemma  1  of  Mathai  (1973). 

Finally,  a  convenient  computational  form  of  the  p.d.f.  of  R  is 
obtained  from  the  following  theorem  proved  as  Theorem  1  in  Mathai  (1973). 

Theorem  M2.  H(z)  is  given  in  (4.3.5)  where, 


(4.3.10)  Rj  - 


(b.4v<j^  )/6. 

+  Z  1  m  ^1+  1 


1  l 

r=0 


J 


x  (-log  z) 

rr1 

*  l 

r  *0 


-1_r 

1  m 


V1 


r  r-1 

y 

r-l 

1 

rr° 

^rl- 

v  •••] 

(r-l-r1) 


where  the  C^s  and  D^s  are  defined  in  (4.23)  and  (4.24)  of  Mathai  (1973) 
We  note  that  the  sets  .  }  are  not  needed  for  h  <  j  in 

II  •  •  •  J 

J 1  Jm 

(4.23)  and  (4.24)  of  Mathai.  Thus,  Lemma  4.3.1  gives  us  all  the  desired 
sets  needed  to  use  Theorem  M2  above,  which  expresses  H(z)  in  terms  of 
the  convenient  computable  functions,  e.g.  the  Psi  and  the  generalized 
zeta  functions.  Examples  of  computation  of  H(z)  is  given  in  Section  5 
of  Mathai  (1973).  Also  computational  procedure  and  computer  programs 
for  calculating  the  percentage  points  of  the  distribution  of  R  can  be 
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obtained  in  a  manner  similar  to  Mathai  and  Katiyar  (1979).  The  null 

2  2  2 

distribution  is  obtained  by  putting  6  =  ^]/^2  =  ^  *n  ^*3*1)  and  the 

nonnull  distribution  by  substituting  the  specified  value,  under  the  given 
2  2  2 

alternative,  5  =  A^/A ^  in  (4.3.1).  It  is  known  that  for  p.^  *  1  or  2  , 

2  2  2  2 

Xj  "  P j  n  j  u j  ^ j  where  n  j  =  Nj  "  1  and  n  j  u j  =  N j  d j  *  J  *  1 » 2  is 

2 

distributed  as  ax  with  d.f.  p.(n  .-p.+l)  .  Hence  if  p.  =  1  or  2  , 

J  j  J  i 

p^  not  necessarily  equal  to  p^  ,  the  exact  distribution  of  R  ,  under 
both  the  null  and  alternative  hypotheses  are  obtained  as  central  F,.  r 

distributions,  with  obvious  multipliers,  having  d.f.  given  by 
=  Pi(n1~Pi+l)  ,  i  ■  1,2  . 


4.4.  Exact  Distribution  of  U  •  We  consider  a  Bartlett  type  modi- 

2  2  2 
fication  for  n  .  Let  Xi  *  p^^  nA  u^A^  where  n ^  -  1  and  ^ 

2 

=  Nf  df  ,  i  *  l,...,k  .  As  in  the  univariate  case,  we  propose  the 
modified  test  statistic 


,  k  ,  ,  "i^Vi//  k  .  /k  \ 


(4.4.1) 


a1  nx 

D 


i1/  ih 


where 


bi  “  (Vi^Vi*  •  cb  “ 


For  P,  -  1  or  2  , 
representation  (4.4.1)  we  get 


Using  this  result  and  the 
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pj 


Theorem  A. 4.1.  For  =  1  or  2  , 
i^j,i,j  *  1 . k  the  exact  density  of 


not  necessarily  equal  to 
is  given  by 

O 


f(t)  -  [rw/n  «»3>]  (  J  »*J) 

•  £(2ir)(k~1)/2/  r((k-l)/2)j  tm-1(-logt) 


(k-3)/2 


nm  D  K(t)  , 

ID)  3  )  D 


o  <  t  <  1 


where , 


m  =  J  p. (n  -p  +l)/2,  a  «  P  (n  -P  +l)/2m,  j=l,...,l 
ill  j  J  J  j 


bj  ,  j=l,...,k  are  defined  in  (A. 4.  ) 

and  n  ,  is  defined  in  Theorem  2  of  Ghao  and  Glaser  (1978). 

m,a,b 

Percentage  points  and  approximations  to  the  above  distribution  are 

obtained  from  Dyer  and  Keating  (1980).  For  pi  >  3,  the  distributions  of 
2  2 

n  or  n  seem  to  be  complicated. 

B 


5.  Large  Sample  Approximations.  Some  large  sample  approximations  to 
the  exact  distributions  of  the  test  criteria  considered  above  are  now  sug¬ 
gested.  Existing  approximations  are  also  reviewed  for  the  distribution  of 
GV  and  SGV. 


2  2 

5.1  Asymptotic  Distributions  of  GV  and  SGV.  Letting  nu  =  Nd  , 
n  =  N  -1,  we  have  from  Anderson  (1958),  Theorem  7.5.4,  that  for 
large  N, 

(u2p/A2p  -1)  i  N(0,2p) . 

P 

It  is  know  that  n  =  NPv  =  |s|/jz|  is  distributed  as  jEi*N-i 
2 

where  the  x  's  are  all  independent.  Hoel  (1937)  suggested  approximating 

1/P 

the  distribution  of  i)  -  w  by  the  distribution  with  the  density  function 
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J  p(N-p)  4  p(N-p)  -  1  -Cw 


r [ i/2P (n-p) ] 


where , 


C  =  C  (p  , N )  t  1  -  ] 

This  turns  out  to  be  exact  for  p=l  and  p=2. 

Gnanadesikan  and  Gupta  (1970)  have  suggested  approximating  the  distri- 
1  1  ^  2 

bution  of  lnw=  —  In  n=  —  V  In  x  ,  using  the  central  limit  Theorem,  by 
P  P  L  N-i 

the  normal  distribution. 

We  now  propose  a  new  approximation  to  the  distribution  of  SGV.  An  ap¬ 
plication  of  the  general  result  of  Madansky  and  Olkin  (1969)  with 
i/p 

h(v)  -Jv|  shows  that,  for  large  N, 

./N(d2/A 2  -  1)  -*  N(0,2/p) . 

In  the  light  of  this  approximation  to  the  distribution  of  the  SGV,  it 
is  interesting  to  note  the  approximation  to  the  distribution  of  GV  by  Anderson 
stated  at  the  beginning.  Little  is  known  about  the  relative  performances 
of  the  four  approximations  discussed  above. 

5.2  Asymptotic  Distributions  of  R  and  n .  Letting  c^  =  0(1^, of 
Hoel's  approximation,  for  large  N1  and  N  ,  the  density  of  R  can  be  ap¬ 
proximated  by  that  of, 


where  5 2  «  A2/&2  ' 


C  N  p  (N  -p  )  - 

2  2  111  6  F 

QiWW  VW-WV- 


The  null  and  non-null  distributions  are  obtained 


by  putting  6  ”1  and  the  specified  value  under  the  alternative  hypothesis, 
respectively. 


1 


2 

In  addition  to  the  usual  x  approximation  to  the  likelihood  ratio 
2 

criterion  n  ,  another  approximation  is  presented  here.  If  is  large  com- 
2 

pared  to  p_^,  i=l,...,k  then  in  the  same  lines  of  Hoel's  approximation,  we  get 
2  2  2 

Xi  =  can  approximated  by  a  x  variable  with  d.f.  p^(N^-p^)  , 

i=l, . . . ,k.  Hence, 

2 

Lemma  5.2.1.  If  is  large  compared  to  p  ,i=l, . . . ,k,  then  the  density 

2 

of  n  under  Ho  can  be  approximated  by  f(t)  defined  in  Theorem  4.4.1  (where 
B 

p^s  now  can  be  any  integers,  not  necessarily  Is  or  2s  only). 

2 

Similar  result  is  seen  to  hold  for  n  also.  These  approximations  are 

2 

expected  to  be  better  than  the  usual  x  approximations. 

2 

6.  A  Multivariate  F  Criterion  A  simpler  statistic  than  n  is  now 
_ max _ . 

suggested  for  the  special  case  when  we  have  an  equal  number  of  observations, 
N,  from  k  populations,  each  of  equal  dimension  p,  e.g.. 


F 

p  ,max 


=  d2  /  d2 
max 


min 


For  p=l,  this  coincides  with  the  F  proposed  by  Hartley  (1950)  as  a 

2 

short  cut  method  for  the  univariate  case.  It  is  known  that  In  xv  ,  for  large  v, 

2 

is  approximately  normal  with  variance  -2/(v-l).  Hence, In  d  is  approximately 

percentage  points  of  F  can  be  determined  from 

IH3X 


normal,  for  large  N,  with  variance 


_  1  I  2 _ 

"  P2  j-i  ^ 


Thus  the  approximate 


F 

p,max 


(a) 


where  r^fa)  is  the  lOOaX  point  of  the  range  r,  in  independent  normal 
samples  of  size  k.  Tabulated  values  of  t^(a)  are  available  from  Pearson 
and  Hartley  (1966). 

7.  Example.  The  following  example  is  taken  from  Gnanadesikan  and 
Gupta  (1970)  who  were  interested  in  a  ranking  and  selection  procedure 
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« 


based  on  generalized  variance.  They  considered  5(=p)  -  diminesional  sum¬ 
maries  of  speech  spectrographic  data  from  a  talker  identification  problem. 
The  data  consisted  of  7(=N)  replicate  utterances  of  10 (=k)  words  for  one 
particular  speaker.  Then, 

1/5 

F  =  (720616.4465/1.5411)  =  13.6137 

p,max 

and  F  (.01)  =  9.0737 

p,max 

Hence,  the  hypothesis  of  equal  multidimensional  scatter,  as  measured 
by  SGV,  is  to  be  rejected. 

8.  Unbiasedness  of  the  modified  LRT's  for  and  • 

The  results  in  this  section  show  that  tests  for  SGV's  possess  the 
property  of  unbiasedness  as  do  the  corresponding  tests  for  covariance 
matrices.  We  will  consider  the  modified  LRT's  obtained  by  replacing  the 
sample  size  by  the  degrees  of  freedom  n^  =  JL  -  1  in  the  original 

LRT,  and  unbiasedness  of  these  tests  for  and  for  Hq2  in  the  case 

of  equidimensional  vector  variables  will  be  established.  The  proofs  for 
the  covariance  matrices  were  given  by  Sugiura  and  Nagao  (1968). 

8,1,  Unbiasedness  of  modified  LRT  for  . 

Theorem  8.1.1.  For  testing  Hqj:|Z|  =  OqP  against  the  alternative 
H^^:|Z|  4  cfgP  for  unknown  y  ,  the  modified  LRT  given  by  replacing  N 
by  n  *  N  -  1  in  the  original  LRT  is  unbiased. 

Proof :  For  given  >  0  ,  let  b^  =  {Z:  |Z|  =  OQP,Zp.d.}  , 

S1  ‘  «'  l£l  *  ^P.  Ep-d->. 

Then 


To  prove  Th.  8.1.1  ,  it  is  then  enough  to  show  that  for  any  Z  fc  b^  ,  say 
Iq  fc  Sq  and  for  any  Z  fc  ,  say  Z^  fc  b^  , 
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(8.1.1)  P(w|H01,E0)  -  P(a)|Hn,Z1)  <  0 


where  is  the  critical  region  of  the  modified  LRT,  i.e.. 


(8.1.2)  a):  S/S  is  p.d.  and 


-Np/2 


expj-p/2  I S 1  /cTqP]  | 

<*] 


>-Np/2 


.3)  =w:js|s  is  p.d.  and  [|]  |z-ls|(N-l)/2  exp  j--P|  Z^S  | 1/p  J 

<  ca  VZ0  fc  S0  ’  1-*”  l£ol*°0PJ- 


This  is  so  because  if  (8.1.1)  is  true  for  any  arbitrary  (Z  ,E  )  fc  (S^&p  , 

then  it  is  also  true  for  any  arbitrary  (E^.E*)  fc  (Sq,S^)  or  (Z^.Z^) 

fc  (fep,^)  and,  hence,  this  will  imply  P(u)/H01)  -  P(uj,H^)  £  0  . 

Let  gfcO(p),  the  multiplication  group  of  pxp  orthogonal  matrices,  be  such 

that  g  is  a  diagonal  matrix  T  where  E^^2  =  (^q^2)  * 

1/2  1/2 

Zq  Zq  =  ZQ  .  Without  any  loss  of  generality  we  can  assume  that 

Eq  =  I  and  Z^  =  T  ,  the  diagonal  matrix  where  diagonal  elements  are  the 

-1/2  -1/2 

characteristic  roots  of  Z^  E^Eq  *  Hence  under  H^,S  he®  a  Wishart 
distribution  with  parameters  T  and  n  =  N  -  1  .  Now 


P(w|H11)  =  c  jsj(N-p-2)/2  jr|-(N-l)/2  exp{_  I  tr  f^sMs 


c  | u|  CN-p-2) /2  i  tj.  u}du 

,*  n.P  2 


where 


c-1  -  TTP(P"1)/4  2np/2  n  T( (n-i+1) /2)  , 

n>P  -t.i 


u  -  r'1/2  sr_1/^2 


u>'  -  {U:  U  p.d.,  r1/2  ur1/2  fc  0)}  . 
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Also 


If  I  -  |rr(p+1,/2 


Since  a)'  «*  u>  when  is  true  and  in  the  region  w  ,  letting 

0^,i  =  l,...,p  be  the  characteristic  root  of  U, 


f(U)  =  |U| 


(N-p-2) /2  exp(_  1  tr  0)  =  ^ 


(N-p-2)  /2  exp(_  1  ZQj 


we  get 


Further, 


<  c'  U 
—  a  ' 


<  c' 
—  at 


~(p+l)/2 
Np/2 


r  \ Np/2  r  (  ^0 .  , 

(f) 


uf(p+1)/2  -  g(U)  , 


|  y  |  (N-P-2)  l  tr  y^y 


OJ-wnai 


exists  and 


g(U)dU  <  «> 


ujnu) 


w-uinai 


,  f  (U)dU  _< 


oi-oinw 


,  g(U)dU 


and 


Thus,  we  get 


[  *  ,  f (U)dU  <  -  [  ,  n  *  g(U)dU 

J  cj^-ojnto1  7  —  J  co  -oiHcj* 


P(u)|H01,E0)  -  P(wiH11,E1) 


-cjnw* 

) Np/2 | 


-c  /f 

n,p|Jw- 


f(U)dU 


co^—wncu* 


,}i»i 


-(p+l>/2 


dU 
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-  v4r{w>'-'-'2 


The  last  inequality  is  due  to  the  fact  that  |u|  is  the  invariant 

measure  in  the  space  of  U  under  the  full  linear  group  G^(p)  transform¬ 


ing  U  -*•  gUg'  ,  g  fc  G^(p)  ,  that  is. 


,uj-(p+l)/2du 


i-(p+D/2, 


|uJu|''r - d  U.  Q.E.D. 

Precisely  this  property  was  exploited  by  Sugiura  and  Nagao  to  prove  the 

unbiasedness  of  the  modified  LRT's  for  £  =  £Q  and  ^  «  £ 

Theorem  8.1.1  can  be  generalized  to  the  k-sample  case.  Let  x^:  p*l 

1-1, ...,N  (N  >p)  be  a  random  sample  from  N  (p,£.),  j=l,2,...,k  .  Let 
3  3  P  J 

S  be  the  sample  sums  of  products  matrix  and  n.  =  N.-l  .  Using  the  same 
3  3  3 

argument  as  in  the  proof  of  Theorem  6.1.1  we  have  the  following  theorem. 

Theorem  8.1.2.  For  testing  the  hypothesis  :  1 2^.  [  =  0^. 

(j”1-,2,  •  •  •  >k)  against  the  alternatives  +  Oq  for  some  i  , 

2 

where  the  mean  VL  is  unspecified  and  <7^  >  0  is  given,  the  modified  LRT 
having  the  critical  region 


u)’ : 


(S^ , • • • , S^) /S .  is  p.d .  (j— 1 , • • . k) 


and 


k 

n 

j-i 


ft 


ftp/2  r 


V2 


exp  < 


1  1  > 
1  s  | 

I/P'1 

><  c" 

/  v  c 

1  a 

Oj 

)  J 

is  unbiased. 
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Consider  further  the  problem  of  testing  ^  against 


H"  :|Z|1/P  >  0? 


^  where  0  <  (^  <  .  As  a  modification  of  the  LRT  in 


Section  3.1  consider  the  test  4>q, 


4>n:  ReJect  H"  iff  !sl/onP  >  cn 


Theorem  .1.3.  For  testing  against  ,  the  test  <})  is  the 
uniformly  most  powerful  invariant.  Also  (J>^  is  a  maximum  test  and  is  most 
stringent. 

Proof.  See  proof  of  Proposition  1  of  Eaton  (1967). 


8.2.  Unbiasedness  of  the  modified  LRT  for  [  |  =  |  |  .  Let 

^  Np<Ul,Zl)’  i,Bl»***,Hl  and  ~j  VM2,E2)’  j=l»---»N2  denote  two 
independent  random  samples  from  two  independent  multivariate  normal  popula¬ 
tions.  For  testing  HQ2  :  ]  | 1/p  =  |E2|1/p  (or  equivalently  1^1" 
against  H12 :  |£  t  l^l1^  »  the  critical  region  of  LRT  is  given  by, 

from  (3.2.1) ,  as 


<  c'  ,  a  constant 


f  21 

r  2) 

K 

ill 

(8.2.1)  w  -  \(si»S2):  2>(N1+N2)p/2  <  Cd 


Sl+S2 

VN2 


where  s^K  ■  |S^|,  ,  i«l,2  are  the  sample  sums  of  products  matrices 

for  X  and  Y  respectively. 


Theorem  8.2.1.  The  modified  LRT  for  H^2  |Z^| 


|Z2|  against 


Hl2:  I ^*1 1  ^  1^1  obtained  by  replacing  by  n^  «  Nj-1,  i“l,2  in 

(8.2.1)  is  unbiased. 


Proof.  As  in  Section  8.1,  let  Sq  »  {Z: (E1,E2> : | zj  =  |z2|  ^.Z^.d.} 
and  Sx  -  {E:(E1>E2):|E1|  ^  |E2|;EltE2p.d.}  .  Then,  SJLS^  4>  .  To  prove 
Th.  8.2.1  it  suffices  to  show  that  for  any  ZeSQ,  say  Zq,  and  for  any 
ZeS^,  say  Z^, 

p(u|ho2,eo)  -  p(<o|h12,z1)  <  0  . 


Consider  any  Z  ■  (Z^,Z2)  . 

Without  loss  of  generality,  we  can  take  m  I  and  Z^  *»  8  ,  the  diagonal 
matrix  with  diagonal  elements  8^ . 8^  . 


P(o)|H12)  =  cr 


‘1* 


c 

P  n2,p 


(SlfS2)fea) 


!si 


(n1-p-l)/2 


|S5 


(n2-p-l)/2 


|e| 


-nx/2 


exp 


H 


tr(0_1S1+S2)> 


dSxdS2 


c  c 

n^,p  n2’P 


<n-p-l>/2  (n,-p-l)/2 

lDr|  1 0,1  2 


(i,u2)eu> 


~ni  ^  f  i  1 

je  j  1  exp|-  J  tr(6  J-+U2)U1VdU1dU2 


(I,U2)to) 


(n,-p-l)/2  -n./2  ,  /, 

|u2|  2  | e !  |(e  ^upl  n/2du2 


where  ■  U^,  S2  *  uJ/2u2oJ/2 

that  U  *  U^2!^2  and  b  ■  c 
111  n 

Jacobian  of  the  transformation 


1/2 

with  U^'  a  symmetric  matrix  such 

c  /c  ,  n  -  n.+n.  .  The 
x,p  n2,p  n,p  1  2 

(Sj^.Sj)  -*•  (UX,U2)  is  given  by 
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1 


3(S1,S2)  .  ^  i(p+l)/2 

3<UlfU2)  |U1' 

1/2  1/2 

Put  V  ■  0  0  7  .  Let  Li*  be  the  set  of  all  pxp  positive  definite 

-1/2  -1/2 

matrices  V  such  that  (1,8  V  0  )fc  go  ,  and  oj  be  the  set  of  all 
pxp  positive  definite  symmetric  matrices  V  such  that  (1,V) fc  u)  .  Then, 


P(w|H02)  -  P(u|H19)  -  b 


12" 


Li 


Li* 


(n  -p-l)/2  , 

|V|  2  |(I+V)|  /2dV 


lvl”2/2 1 |vf<P+«/2dv  . 

Now  consider  the  following. 

Lemma.  If  and  S2  are  two  positive  definite  matrices  of  the 

same  dimension,  p*p  ,  then 


=  b< 


Li-LiHLi* 


CL)*-wna)* 


Proof  1  Since  and  S2  are  two  p.d.  matrices,  there  exists  a 

nonsingular  matrix  M  such  that  MS^M'  *  I  and  MS2M'  ■  A  “  diagonal 
where  <5^  >  0,  i*l,...,p  are  the  characteristic  roots  of 
S2S~*  .  To  prove  the  lemma,  it  is  enough  to  show 


1 


^Professor  Olkin  has  pointed  out  that  this  can  be  proven  alternatively 
through  the  concavity  of  |s|^**. 
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(3) 


m 


{|MS  M'  |1/p+|MS2M'fL/p}P  <  |MS1M’+MS2M' 
i.e.  {|l|1/p+|Al1/p}P  <  { | I+A| } 


L.e.  1 1 +  n  6  1/p>  <  n  (1+6  ) 

l  1=1  1  >  1=1  1 


1/p 


which  is  ttrue  by  Holder's  Inequality.  Hence  the  lemma. 


p<“Ih02>  "  -  bca  I  [-  - 


L  ;  0)“U)00)* 


10*0)00)* 


|  ivr 


•Cp+D/2 


dV 


bc'j  "  \  }lVl"(P+1)/2dV  =  0  * 


since 


_|v ! 

(jj 


(n  -p-l)/2  , 

|(I+V) |  /2dV  <  « 


and  for  any  subset  uj'  of  oj 


(n  -p-l)/2  , 

|v|  2  |(I+V) |"n/2dV  < 


(n„-p-l)/2 

I  ^ 


0)' 


_(i+|v|1/p)P] 


-n/2 


dV 


<  c' 
—  a 


(by  the  Lemma) 

|v  |-(p+l)/2(Jv  <  .  q.e.D. 


w 


3 
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9.  Remarks .  As  with  any  multidimensional  measure,  the  SGV  cannot 
be  expected  to  be  the  unique  measure  best  for  all  situations  of  multi¬ 
dimensional  scatter.  However,  if  we  are  interested  in  'overall'  scatter 
and  where  magnitude  of  individual  variances  separately  are  not  of  great  con¬ 
cern,  the  SGV  can  be  expected  to  perform  adequately.  Wilks  (1967)  gives  an 
expository  account  of  GV  as  a  measure  of  multidimensional  scatter  from 
geometrical  standpoint. 

This  paper  suggests  several  topics  for  future  research.  Admissibility 
of  the  LRTs  may  be  studied.  Also,  alternative  test  procedures  to  LRTs,  e.g., 
by  the  Union-Intersection  method  will  be  interesting.  Sequential  and  non- 
parametric  test  procedures  for  SGVs  may  provide  further  insights  into  the 
problem.  The  case  of  one  sided  alternatives  and  singular  dispersion  matrices 
seem  to  be  of  great  practical  importance  and  interest.  Finally,  one  can 
explore  situations  where  SGVs  may  not  be  adequate  and  for  such  situations 
provide  adequate  measures  and  tests  for  multidimensional  scatter. 
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